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PCTAJS99/19069 



„SE OF POOLED PROBES IS OBNETIC 
ANALYSIS 

-.EMES. OP ^„ , 

iHGOl 323, and the 
this invention. 

„„V. o£ °li5°""'='"°"tt,^et nucl.ic acids of 
variety ™e..cds for ^''^^^^^ ^^^^^ .e novo se,u.noin. of 

, „,get nucl.io a=.d. Such ^^.^ ^ ^^-^'^^.^^ 

achieved by hybridizing a targe ^^^^^^ iaent.fy.ng 

,„ay of all probe ^-•I"^-"^ °' ^/^ to the target. Another 

sub.et of probes ^"-^^f^t^^^aLuication of X-ls 
application is ^>^^*^"«X;:ppUcations involve comparing a 
in a mixed population. Other PP ^^^^ ^^^^ 

:rr:rr:rce L the presence of 
differ from the retei variations. 
„„.ations, P=ly"°'^''^="=^;"',:, iaentifying variations in a 

R Simple strategy f°r discussed by 

.arget se^ence is the reverse dot^b^^^^^^ ^^^^^ 

oattagupta, EP 235. „,get nucleic acids with 
, £or comparative analysis of '■ J ^ „/ii,95 

reference nucleic acids are descr 

Cincorporated by -^'^^'^^'^ .ets. A fi"- ^ 

some such arrays '"^l-'^r '°"J"i„g a region of interest .n a 
Includes overlapping ^Tn the first probe set has an 

„ reference sequence, ^-^ jf ;^^^^„„,3 „ a nucleotide rn the 
i„„rrogation P-""" interrogation position rs 

-"r rtrcorrpon^i^g nucleotide in the reference 
aligned with tne ^ 
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, .ference sequence are aligned to 
sequence, when the probe and re ^^^^ ^^^^^ 

maximize co.ple.entarily ^^^^^/^^^^^^^^.^.i,, probes from three 
.he first lere are four probes 

additional probe ^^^l^ J .^e reference sequence, 

corresponding to p.obe sets are identical 

probes fro. the /,,3t probe set except at 

,o the corresponding probe f ro^ . ^ .^^^ ^^^^ 

.he interrogation P°--°- ^^^^^^^^ ^^..es fro. the four probe 
,n each of the four --spo^d n^^^^^ ^^^^^^^^^^ 
sets, and is occupied by a dx 

probe sets . ,K-ridized to a labelled target 

such an array xs hybr.dxzed ^^^^^^^^ 3equence, or 
sequence, which .ay be the -e ^ ^^^^^^^^^^ ^^^^,,3, 

. variant thereof. ///^^"/.......ed by comparing the 

in the target --^-^""r !l\^e four probes having 
hybridization i-^ensxtxes of t ^^^^ .^..^otide. The 

, further strategy for -^P^ ^^,^^^3 „ 

„lth a reference overlapping probee spanning a 

.trategy, an array ^^'^^ J/^f, fe<^ence . T.e array is 
region of i-^""^"" ' !„„et sequence, which may be the 
hybridized to a ^^^^''^^ fvariant thereof. « the 

sa.e as the reference sequence, probes 

target sequence xs a ^"^^ 3^„„ reduced hybridization 

overlapping ="e °f J--^ ,„,ys rn 

intensity relative ^„ fashion stepping 

0 which the probes are ^^^^ successive probe 

through the reference ^^'^^^^^J^. ..^e than its 
has one fewer base and „ ,„,3„3ity is 

predecessor, . ^^^'^.tTot to^s approximately centered 
^„i.ested as a .he target sequence and 

35 about the poxnt of varxa 

reference sequence. 3trategies described above, 

.Oh probe^sr::! array occupies a unique cell or 
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^ ^He signal bound by 

WO00/im3 arrangement, trie 

each probe i> 303-30-> USeS) dis«= 

se,5uencing by W ^^^^^^^des long, ^n ^^.^ j^ur 

central pos'^ ^^^aj, a cei ^ sequence. 

nucleotiae " „j ,i«een probes °' «^ 

,c„p.e. bV a '"f"::,:::^ 3o.e arrays 

"rilsiaons. V.0 '^/""^ Zs. Poolea probes 

central posi" ^es o£ probes. „ ^ common 

containing P°°^^X complementary to a 

^eve component except at one or a I ^^^^ 

segment of a target se^^ ^^^^^ .^^to aet" t variations 
within the P"''^';^^ ,,,eral strateg.es " J^' fhese 
' '%rence relative to a reference «J 

" = ""^'"are^^ts can have advantages m re ^^^^^ ^^^,„ce. 

TaSy " ' 

0. ™k --y^Tprynucleotide 
,He invention provides arrays P^ ^^^^^ . 
,„bes bou^ to a s.P^rt -,\3ving at least three 

^ ^. ^-rravs comprise w-ars a po°^ ° * 

some such arrays ^^^.^^ tears ^es . ^ 

discrete regions- ^ ^^^^..^ing first and seco ^^^^^ 
,3 polynucleotide P^-s c ^P^^^ ^-^^ ^f;,,, tirst 

second region bear ^^^^^^ ^.^be w 

and a third region ^^^^^ .a p 

probe. Xn some first and secon 

.fively complementary arrays, the 

respectively sequence. 1^ s ^^^^^.^ ^^,d 

first site and th ^^^^ ^.^e. first 

3^ -—''i::r arrays, the sec^d -n.\olyn.cleo-^ 

. «ith a third probe as - ^^'^ second probe «th 
probe «ith bears 
probes, and the 
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wooo/lim polynucleotide probes, m 

fourth probe, as ^^''f j;,',: oLple^entary .c a second 
so^e arrays, the ^^^'^J^^^ polymorphic ""'^^/^^^ 
poly.=rphi=J-;^^-;^„„ - ' 

'"""ttrpolymorphic site. ^ plurality 

irarray comprise = -''"/f^^, diHereht 
the different regi gecona 
oe discrete -^i-- ;^, prohes co-^^-^^;;/:', t ,„,et 
pools of P-f \o nonoverlaPPWS «g™ents ot 

^-^^%r r :rsu=h arrays, ^r,:-"- 

:rt«.et secr^ence include ""'^ ^^e pectiv.lV co^ie.entary 
r first and second probes are P o^hic 

Ihic torms of the first and seco P ,o 

'° tThfdl""- --'^"^c fot: the different 

sites, tne ^ . ^„ nf Dolyrnorpnic i-'-' i:^-rtns In 

,,„erent -n*^-":;: fo^^Jtion of P°^V™=rp^reast a subset 
pools ^"'"-^J-^,;:: p„be is the sa.e .n at leas^^ 
some arrays, the ^^^ond probe 

of the Pl""^''''."' ;r,^set. in '"r^oles Each 

different pools .n t^e ^ ,^.ets of probe- 

„ p„bes -"-""^/rubset Of pools has a co^on ^^^^^ 

pool in the f irst ^^^^ s. ^^^^^^ 

a different secon F ^ different 

pools has a differs between the first 

the -"^"Z^^^^, ,„,«t of pools. , 

" Z invention -"^X- "/io/s. ^ first 

— tirorx--- -Tfrt"trs:=r' 

"rUbr^ a first 't:s: Prde -ithout the 

second prob ^^^^^ present at a second 

3„ probes. ^ se ^^^^^^ ""^/o^ote 9reater than first 

-ac tnola^ rati 

— "^".he invention ---rftrs tftUet nucleic 
,«erminin. Un-^e of PO^V»orph.c 
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WO 00/11223 hybridising a diploid "^^e^ 

nrtsing = support having at lea «ue probes 

array c""'"""/^, „gi„„ bears a pool o£ P ^ £or» 
regions- A f.rBt 9 „„pieBent.rv "//„j,r=o^le".entarV 
comprising a first P ^ ^e^ond pro ^^^^^ 

- - ^"^ir/o" tbe secon. pol^-^ a 
to a polymorphi^c £ ^.^^^^^ the sec 

„gion bears tba *-st p^^^_^^ ^^^^ «"7!^X,'o^ the target 
third region ^"" '^^^^^^e a ratio °' ^f^J^, „a third 

.econd VO^r°Z^^^ nucleic acid. ^^,,,,3 o£ 

■ ^^^^"UTnvention P-^^" tt:!: in a target nucle.c 

determining Un.^e - -^-^..^.g a dip^- 

. Tctdtvtn Urst and -t^l^^, discrete 

nucleic acid n ^t having a t pools of 

array -"^^"^^l/erS regions ^"^"^^ td second probes 
regions, f «%„^e comprising ^-^^/f^^, the first 
probes, a pool ot P polymorphic t comprising 

Respectively -"^""'""rtes , the different pools = ^ 
Td second P^^vmorpbic sit^^^^^^^ thl dtsfrete 

„ probes complementary ^ ,cid to - ^.^^^^^^ 

■ "Tn dlteUed to i^-'f/^^l^ran an average of 
"^""%;:t b'ds more target nucleic a« ^^^^^^^^ 
region that bl ^j^, discr 

target nucleic '"J^ a pool of P 

, control mi«ur« site an^ a se ^nd 

35 form at the f irst P ^^cond Pfj^ f^^^crete 

having a PolY™"'^' the mixture to ... 

screte 

determining "Vbridi^^ to the 

regions. Binding of th 
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wooonlffl then coi.«.ares binding o 

fhen determined- "n discrete regions to 

regions is then ^^^^^ „ ^he target 

„,,et nucleio aoid ^ """t/.icn bearing a 

,aenti«y = .":;:rtbe control, tbis --«-;/;::::,„,tion of 

::rr. ;:::^rr:spe«i^^^^^^^^^ . tbe di.oid 

, <c acid, such methods en ^s^rate having 

. target h^^"^ J, „ an array pools of 

"'-^ uf;- -Ute regions di«eren^ 
a plurality oi. . ^ common first f ^o a known 

rtiVet --^rtargersererls^ -^^^ — 

of the segment m .Sizing a target 

" - -TthTrr^encing ^^^^I^ZL a 

■0 ---^ rrer:urs%-;- rre 

plurality oJ „i,„ein the P 

aifterent ^J^^ ^..array of pools, eac P ^ 

^''^ '"l^ray ot POols having ' --fj^eld subarray of 
tirst subarray the ^ 

„ different second P«^ ^^^^^ ^tTo^ "robe , the co»^n 

^ola and a ^i"";"/ !,:i3 ^eing co^le^entary 

^arlcer m the 9 ^^^^^ subarray of poo ^^^^^^^ 

:rr^-« -=-;rt r ercord%eg»=ht cf 
30 pcou. ^ "-rrn r-i- Tsranr=^- 

nucleic acid is t ^^^^ " in the target . 

nucleic acid to th P "^t^ trpositions of the 

position of f 1 ^^^^ „,,ave to the 
nucleic acid is tn monitoring 
„ .no»n .ar.e» ^„,ner P-vUe/ entail 

TnRNA population. Sucn molecules, 
egression ot ^ f;„,;,l3ing a population of 
providing a sample 
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WO00/n223 .^3 copied therefrom is 

The population o. « ^f^l^^^.^n. a support havin. a 

.h.n hybridi2.d to an array c P ^^^^^^ 
plurality of aiscreta "9-"- th „n*rising first 

'different pools of ^-f/^^/^.^^f^^ieLntary to noncverlapp.ng 

„a second .probes re^ct.vely ^^^^^^^^^ 

segr,ents of a knovm m „„pi.^„tary to 

cofprisin, first -^J^/^fJ^I^e^ent Hnown -lecules. 
nonoverlapping "3"ant= 3p„.f.= 
0„e then aater.inas wh.ch dxscre ^^^^^^^^^^ ^^^^^ 

binding to the P^^''^"'"" f;;,„/i„ the sample. 
Mown mBSR ^lecules are P«=e« ^ ^^^^ further comprises 
in some such methods, different regions 

. second plurality o^^^^^^" fa ; pool having the same 
tearing different pools of probes. ^^^^ 
nrst and second probes except Jcr a ^ „^ ,,,1 ,„m 

.he first or second probe or both as further 
Z plurality of discrete "^^^^J^^,,^, pools of probes 

Lrises comparing binding of corre P ^^^^^ „gions, a 

rm the Plurality -r^^/^.^^^- ^own to which 

— :rii::"^^ - 

Cre-tary is P"- in ^ of analyting 

The invention ^""''er P hybridizing a 

, „„et nucleic acid -f^f/ot Js-g a support having at 
„rget nucleic acid to an array ^^^^^^ , pool of 

least three discrete "9— ■ second probes, a 

polynucleotide probes ^^""f „,,hout the second probe 

LcS region bearing ^^^/^J/^/^..^, p„be without the first 
„d a third region ''''""^ 'he s 

one then "mpar" bitf, aggregate of the target 
to the first discrete -e^-^^^^^^, third regions to 
„ucleic acid t^r/^'Icleic acid includes segments 

determine whether tj^e targ ^^^^^^ ^^^^ 

coinplementary to the 
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. „hotopl°"'* , array- ^ .^^s surfaca 

over °" , Met.fO=-l'* Mli aeprotec"*^^' 
lrradia»»9 .a « ^at ^aW . TY>e d ^^^^ 

' a..anaara ua*'^ 'pped. , used « 

, co=perat-j;va '«t;:;r ar. ^" 

^ifferen on entary ether b^^^ 

::aX' r --^^^^^^^ 

or li^*^® ^ strong . suw ^ ^ tne 

^«-";:a" ar,a" ^ t^^^^tt ^= 

targets t of a ^' ^goO ^ 

central probe 2 v 

-^^^'ve rectangles, 
successive 
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WOOoniM^ to the probe 1 

x.OO rectangles P-'^-^\'f^^ substrate. The P.obe 2 
" °s on the same region of the rectangles . 

rectangles °^ . f.. C. O, and T in ^ 

central P°"'-;/j„,, oontainea ^^^^^^^^^.s. 

r.rrerco^--"''^ 

different „,«tive vs. non- 

nuorescence images "^^ays. The design 

e^a^iv. h^^ridi^ation to P^^^^J^l^^^. .yhridi.aticn to 
cooperati ^howi in Fi9- 5- „hovm £or four 

°£ the * probe pair - f7„,.„,, ioc-2^g. 

the double e pairs (10g-"c, 

ai«erent ^^f^Y^^,, left hand "f^, in the 

"-r^hHor "spending unlinked targets 

""'"nl rfght hand colu™>. 

adjacent rign 10g-2-'c) and 

3„.S0 fixtures oe '^-";^;:t;;anels ot the left 
* d lOc-2-70) are sho«n in the t P ^^^^^^^ 
aog-27t and 10c ^'^^""^he P-»"^ " 

nand column, cowositicn. '^/J^^eHment, which 

,,e identical in -^"^ ,,,,oted m the ej ^^^^ 

The botton, panel xn tb ^^^^ ^70 ^^^^ 

--^^:C:sirnt::al to tbe t.o .^er Pa^^^^^ are 
composxt.on this case tbe ^^^^^ 

- ^tJerana .ence no cooperative e..ect 

• n and assignment of 
, cooperative -y-^^f;";;3 ^45. and 203B 

^^3- n SNPs separated by 693, ^ 

30 ™'='=°" differ in sequence at P .„ ^ig. 3, 

amplicons that d ,,,4 as de=cr ^^^^ ^^^^ 

The »r«ys „e 30-mers "^^J^^^, ,„„eoted 

except that the P ^^^^ ^,^,1. "'tferrogating positions 

from the 3 - ,„ey int« 9 ^^^^^^ 

,5 intensities from ^ amplicons. The linages 

can be assig 
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WO00/.12« positions 1424-1446, 

array is coinplementary to p 
probe 1 on the array positions Sin-Zl'l- RiS" 

p„be 2 is ""^1^"""'""/" hybridisation. B) 
discrimination plot of the ,,,e 50 -.50 target 

interrogation of positions " „obe 1 complements 

n-foi^Trr: TcipreU ^-^rs-:;- Xt 

-"irr Jation of -"--^r^in"; ^ 

contains ,1,-^"^ - ^„,..ons 211.-2141. 

■ .... different anchor probes »ith 

, g Effect of pairing six a tilings 

Itst^encin. ^ PrP^fs tp thro.h a^re.ion^ 

Shown, 86 colutTa.s of over PP ^ incren^ents " ^ 

cotnplen^entary P-°^\"^ ^n;, sequence. The seven 

-rit:rrtrs\r.nu;^^^^^^^^^^ :^ri: . 
^-i:::^X -.ni: t.::: it-r 

I^rToL: cU-"— "ly are - i-- fi^re, . Th 
S4,%.--, '-"/rrhtl silence overlaps the l=-mer 
^ irr:;rn;r/-erce are online, in «hite. 

, , comparison of si.a. -^r^retf- ^ ^ 

'Xaan. unpaired array desi.e^ ^ .^beled ss^. 
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;ai;ed ana unpaired array designe^ ;;,,,„.,,^led ssOH. 

mitochondrial mK^ J^^^J,^, ^paired «se^en=in. 
ampUcon was hybridized " of perfect match probe 

°' 'trrerr-htative portions of the array 
ritrter:: p:s::ton m the ---r^^ved^rom positions 
tchor se^ence in the P-;; «-^„ eubstantially 
,,a,.l446. Signal and discri ^^^^^^^^ 
increased in the paired array 
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^PQXvribonucleotide or 
nucleic «id - .^ f;X„gle.or double-.tvanded 
...onucleotide polymer in e^^- = „„,,,,.iaes uni... 

',o«, including .no«- ana.cs 

,,herwise i"'^^""'- ,,,,iae is a single-stranded nuclei 

" "''Tin to about 500 bases. 
ranging in ^^"f ,y„.hetic or """"^^^ ^^^aing to 

.Ugonucleot.de ean^ an oligonucleotide ca^^- "^^^^^^ 

aet nucleic acid o£ '-^^'^""'"J ^^Z^^^ complementary 

-a^fied bases (e.g-- ' tjrobe may be Do^" 

°' le bases in °"9on«leot.de pr ,^ 

.edition, thej ^ Phospbodiester bond. 

^ ""'rer/ere «itb ^bridi-^i- „„„i„ent 
not i«««" iae nucleic acids in wh ^^.^iester 

^"^-r iot/el '.V Peptiae --"-"fro-fo bases long, and 

" " 'p„bes are typicaUV ^ ,3,^ as 

linlcages. « .° Z iist.l segments of a 

tr.r-.rr ^^^^^^^^^ 

stringency to mere 
' ^""^ matched pools of probes. binding, 

-^--"^'tciUc >.V-i--- re^e U " » -rt-ler 
auple.ing. or ^^^^^'"^ °/„:„,ent conditions wben that 
nucleotide ee^-^ "^tcolplel mixture .e.g., ^o»l 

" '-r ,"ror;r ^t^irta^e 

:rr w^- a p.obe -1 bv^r-"^^^^^ ^ditions are se.ence- 
:^:%o no otber se^ence- -'^ ,,„umstances . 

dependent and ^"^^^'^^ specif icaUV at bigber 
3S Longer sequences hybridi ^nditions are s^ 

„„peratures. ,,.rm.l melting po.nt ^ 

be about 5.C f ^",,,,„ed ionic strength and pH. 

specific sequence at 
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WO 00/11223 Strength. pH. and 

,s the te.pera.ure (under ^et-e^d ^^^^^^ 
nLleic acid concen^at- at^^^^^^ ^^^^^^^^^ target 
complementary to the ^-^^^^^ ^/f^^ ,^,get sequences are 

stZence at equilibrium- As the ^^^^^^ 
Orally present in excess, at lin conditions 

generally P . Typically, st J ^^^^ ^ ^ ^ 

occupied -^^f ^^^,,^,„tion of at least abou 
include a salt con ^^^^^^ at pH 7.0 t 

ion concentration (or othe ^^^^^^^ (e.g., 

^ ^ at least about 30 C tor achieved 

temperature is at conditions can als 

Ltraalkyl ammonium salts. Fo ^ 

I r^ 50 tnM NaPhosphate. 5 rm ^ific probe 

(750 mM NaCl. 50 ^^.^^^^^ allele 

temperature of perfectly 
.perfectly matched ^X^:^- a p.be 

--^Tari"pr./ecC::^P--- ^%LTer mismatch 
r^r^^^e^o. .be --rce Tdeliberately selected 

KP" refer to probes whose sequ ^^^lar target 

3 probe rete complementary to a P anywhere m 

" Tlthlu/h the mismatch(s) may be 10- ^^^^^^^^^ 

^^^Tmatch probe, terminal ---f;;;/;,,, .y.ridization of 
,l,e is less likely to pre ^^^^ 

a terminal -^^"^ probes are often ^^^^ 

.5 the target sequence ^^^^ ' e t^e duplex 

the mismatch locatea destabilize tn 

with the targ 

conditions. . «»vker or site is tne 

- ^"^^°^r.rrer.arHe« have a. leas. «o 
alvar^ence occurs. P«'a g„ater than 1*. 

tueles, each =ccu«^9 . eelectea 

P"";-r;r;r;h.c locus -V - - -f,r^en. le„,.h 
T.''"-lV.o^'^= -^^"^r^"? "n-::. eats .V».H. s, . 
" V^or^hiCs. -'^^^';:rsr«U "es, .inudeo^ia. repeats, 

i;rrvar.a.U regions „peats, s.^le 

trinucleotide repeats, 
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. and insertion elements such as Alu. The 
sequence repeats, and insertion designated as a 

.J allelic form is aroitrarixy ^ 

first Identified allel designated as 

.he reference for. and „„„ring 

aiternative or ^^^^^^ ^ t.tL is so.eti.es referred 
„>.st fre,^ently xn a select P P ^ homozygous 

" as the «i^^;VPe 'o™^ Piplo g ,,,^.rphis™ 

'""rr r rta p^^^orphis. has three forms. 

,,3 t»o fo»s^ * ^^^^^^^ .el^orphis. >SNP. occurs at a 

, orphlc site occupied by a single nucleotide, which xs the 
polymorphic site P sequences. The site is 

aite of -«»"7^^^';j^„,,„„.a by highly conserved se^ences 

7t: :::^tl °^ 

"nTin:Ltc!:::rd:To:U-" usuauy anses due 
,o substitution Of one --ct«e for another at^^ 

A transition is tne repxa 
polymorphic site. A ,^.„Hdine by another 

r ^""rsi:: :rtr::rc:meit of a punne by 

pyrimidine. A trans e „„,i,„,iae polymorphisms 

^ ''1:farise frora deletion of a nucleotide or an insertion 

:ra"rci:rtrde relative to a ^^^^^^2:^ „ _ 

pool probe mixture is a mixtur 

probes occupying a single ^^--^J^^^^^f.rrown! the 
the identity of each probe^n the ^^^^ ,,,ressable. 

individual probes in the P°°Yfrom a cell bearing a pool 
.bus, the -^^^^^X::^^ Hhtt Of the different pxobes 
probe mixture is the aggres 

occupying the occupied by a first 

mien one " " ,,„nd probe is 

probe without a "^'^^ff/.^Tl hough trace amounts of the 
entirely absent from the eel ^^^^^^^^ ^^^^^^^^ 

second probe (e.g.. less tnan 

. --rirgeTsrurrir:: -.i. as.- 

.he preferential as»-a-n of a Pa«i-ar - ^ ^.^^^^ 
roro^l lortrorrr^fre^ently than expected by chance 
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for any particular allele frequency in the population. For 
«a^le, if locus X has alleles a and b, «hich occur equally 
frlXn ly. linked locus Y has alleles c and d, -h.ch 
occrequally frequently, one would expect the cc^inat.on ac 
to occur wi.th a frequency of 0.25. If ac occurs more 
frequently, then alleles a and c are in linkage 
disequilibrium. Linkage disequilibrium may result from 
'«:ral selection of certain combination of alleles or because 
anTuele has been introduced into a population too recently 
to have reached equilibrium »ith linked alleles. 

A marker in linkage disequilibrium can be 
oartlcularly useful In detecting susceptibility to disease (or 
Srphtnolype, notwithstanding that the marker does not 

..use the disease, .or "^f ^ ^tsTa b^t w ch Is in 
itself a causative element of a disease, 

e dlse^mbrium with a gene .Including regulatory 
(y) that is a causative element of a pnenocype, 
rtsra a t c ed o indicate susceptibility to the disease in 
rirc^lttances in which the gene y may not have been ident.f.ed 
or may not be readily detectable. 

DETAILED DESCRIPTION 



^-'^he invention is premised, in part, on the result 
that two different probes In a pooled mixture of probes can 
r Itaneously hybrld.e t^^^^^^^^^^^^^ 

:XgtfT::r;:t\ra u 

than the sum of binding of the target to ^''^ '^^XLr 
separated In Individual cells in an "^^^^^ J"^;f,,'":pectlve 

«v r^f f he two probes can bind to its xesp^v-w 
arrangement, each of the two p ^^^^^ 

complementary segment xn a target s qu 

irr:.:"- -^^^^^ - - -r" 

r^srv ral methods of analysis that f^e , 

.ale target molecule containing two segments of interest, 
Tdlwo tXet molecules, each containing one of the segments 
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Of interest. Applications include increasing the specificity 
of hybridization in mutation detection and gene expression 
monitoring applications, determining SNP haplotypes, 
characterizing repetitive sequences, such as short tandem 
repeats, and aiding contig assembly in sequencing by 

hybridization (SBH) . 

In a simple illustration of such methods, an array 
of probes is designed having three cells. One cell contains a 
pooled mixture of first and second probes respectively 
complementary to first and second segments of interest in 
potential target molecules. A second cell in the array 
contains the first probe without the second probe, and a third 
cell contains the second probe without the first probe. 

The array is initially hybridized with a control 
equimolar mixture of first and second target molecules 
respectively containing the first and second target segments 
(but not both) . Typically, the target sequences are labelled. 
Binding of target to different cells in the array can thus be 
determined by scanning the label. The binding is separately 
determined for each of the three cells, and a ratio is 
calculated for binding to the first cell (pooled probes) with 
the sum of binding to the second and third cells. In 
idealized circumstances, since cooperative binding of pooled 
probes to separate target molecules is not possible, one might 
expect that the ratio of binding might be one. However, in 
practice, the binding ratio often varies from one due to 
factors such as variation in probe deposition between cells 
and steric crowding of probes in the pooled cells. Thus, a 
normalization factor can be calculated to convert the observed 

binding ratio to unity. 

The array is then hybridized with an unknown target 
sample, which may be either a single target molecule 
containing both first and second segments, or two separate 
molecules, the first of which contains the first segment only, 
and the second of which contains the second segment only. 
Again, the ratio of binding of the target to the first cell 
relative to the combined binding to second and third cells is 
determined. If appropriate, the normalization factor is 
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applied. A ratio of binding (normalized, if appropriate) 
greater than unity indicates that the sample contains a single 
molecule containing both first and second segments of 
interest. Typically, a single molecule containing both first 
and second .segments give a normalized ratio at least two fold 
higher than a mixture of separate molecules, each containing 
one of the segments. 

In some methods, multiple cells in the array contain 
different pooled mixtures of probes. Such arrangements are 
particularly useful for analyzing targets that can have 
multiple combinations of segments. For example, a target 
nucleic acid having two polymorphic sites, each of which has 
two polymorphic forms (A/a and B/b) can have four combinations 
of polymorphic forms AB, aB, ab, and Ab) . To analyze such a 
target sequence, an array is designed with four cells each 
containing a different pool of two mixed probes. The two 
probes in each pool are designed to be complementary to one of 
the combinations of polymorphic forms (i.e., A'B', a'B', a'b', 
and A'b', where primes indicate complementary sequences). 
When such an array is hybridized with a target sample 
containing a single target nucleic acid, the pool of probes 
having both component probes matched with the target nucleic 
acid shows highest binding, two pools of probes having one 
probe but not the other matched with the target nucleic acid 
show intermediate binding, and the pool of probes having 
neither probe matched with the target shows the lowest 
binding. 

When such an array is hybridized with a mixture of 
two target molecules containing different combinations of 
polymorphic forms at the two sites, as in a sample from a 
diploid organisms, a different pattern results. In this 
situation, two pooled probes have both component probes 
matched with the same target molecule and gives the highest 
binding signal. The other two pools may have only one 
component probe matched with a target segment or can have both 
component probes matched with segments on different molecules 
of target sequence. In the latter situation, the two 
component probes in each pool are matched with different 
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molecules of Mrget sequence, a»d cooperative binding i= not 

tib e Thus, the binding o* pooled probes not matched „xth 
Tsingle target molecules is lower that for the two pools of 
orobes having both probes »atohed with the same target 
CZl The Observed relative binding of the our probe 
"hus serves to indicate which of the co.bxnat.on(s) of 
polymorphic forms are present in a target sample. 

^-^^'^^^^ft^eguence is either a .nown se<r.en=e or a 
variant of a >cno«n or partly Known reference se.^ence^ A 

—rsr^er^^^^^^^^^^^^^^ 

B Va-rs:- r:: ^-rdeTumrn :ir 

Tciated Tth ginetic disease. Examples of such genes 
associated witn g , „ « r- and K-ras, cytochromes 
include BRCA-1. BRCA-2, p53, N-.C ^ . ^ 

r>.^n CFTR HLA classes I and II, and ^-globm. 

^^^'^^^^^^^^^ ,,,, can be genomic, or cDNA. 

i«« are usually subject to amplification 

, if th^ individual is heterozygous. 
^.Tam us are also often subject 

.™,lification is typically preceded by reverse 
I„ this case '-^^^^^'^ ,^,essed can be 

"°r" Tirdescrib d b" Lcnly owned «0 ../1«3, and wo 
. °L:rification^f an sample from a diploid 

■ al ie can generate two species of '-f^jft^^i:, L,^,^^. 
individual from whom the sample was obtained is heter 
at a polymorphic site occurring within expressed R«A. 
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The PCR method of amplification is described in PCR 
Technology: Principles and Applications for DNA J^plification 
"11 n A Erlich. Freeman Press, NY, NY, : PCK Protocols: 

A Guide to Methods and Applications (eds. Innis, et ^^'J 
LLmic press, San Diego, CA, 19S0) ; Mattila et al Nucleac 
"ids .es. 19. KcKert et al., PCK Methods nd 

Applications 1, 17 (1991); PCR (eds. H ' ch is 

press, oxford); and U.S. Patent 4,683,202 (each o whxch xs 
incorporated by reference for all purposes) . Nucle.c ac.ds .n 
a target sample are usually labelled in the course of 
Lolification by inclusion of one or more labelled nucleotides 
amplification cy attached to 

in the amplification mix. Labels can axso 
amplification products after amplification e.g., by end- 
TabeUing. The amplification product can be RNA or ONA 
depending on the enzyme and substrates used in the 
3„plif ication^reaction^^ amplification methods include the 
U.ase Chain reaction .C.) ^^e- ^d W.^ .eno.ics . 

560 (1989), Landegren ' ' Acad, 

transcription amplification (Kwoh et al . . 
ScnSA S6, 1173 (1989)), and self- sustained sec^ence 
replication (Guatelli et al., Proc. Nat. Acad. Sc. USA 87, 

19 )) and nucleic acid based sec^ence amplification 
CbA) The latter two amplification methods involve 

lelmal reactions based on isothermal transcription, which 
;::rcr:otrsingle stranded HNA (ss^A) an. ^ 
DNA (dsDNA) as the amplification products m a ratio 

rva;i::rorra:e!s can be incorporated into target 

nucleic acids in the course of ^^^f ;;-^°;^;;^;,f;L 
amplification, suitable labels include fluorescein o 
Z latter being detected by staining with ^^^T 
■A^r. =,fter hybridization. In some methods, 

:;rra r :rLt. ....... ac.a= . co™pa„. 
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diluted, if desired, prior to hybridization to equalize 
fluorescence intensities. 

■ Supports 



S.upports can be made of a variety of materials, such 
as glass, silica, plastic, nylon or nitrocellulose. Supports 
are preferably rigid and have a planar surface. Supports 
typically have from 1-10,000,000 discrete spatially 
addressable regions, or cells. Supports having 10-1,000,000 
or 100-100,000 or 1000-100,000 cells are common. The density 
of cells is typically at least 1000, 10,000, 100,000 or 
1,000,000 cells within a square centimeter. In some supports, 
all cells are occupied by pooled mixtures of probes. In other 
supports, some cells are occupied by pooled mixtures of 
probes, and other cells are occupied, at least to the degree 
of purity obtainable by synthesis methods, by a single type of 
oligonucleotide. The strategies for probe design described in 
the present application can be combined with other strategies, 
such as those described by WO 95/11995, EP 717,113 and WO 
97/29212 in the same array. 

Typically, the component probes of a pool are 
present in the pool in equimolar ratio. However, in some 
arrays, some pools have more of one probe than other. For 
example, an array can be designed in which one region has an 
equimolar region of first and second probes, a second region 
has the first probe and second probes present with the first 
probe in excess, and a third region has the first and second 
probes present with the second probes in excess. In general, 
a target bearing first and second segments coniplementary to 
the first and second probes shows stronger normalized binding 
to the first region than the aggregate of binding to the 
second and third regions. Some arrays also include regions 
occupied by single probes. Within such regions the single 
probe is substantially pure (i.e., at least 90%, 95%, 99% or 
99.9% or 100% pure on molar basis) other than for incomplete 
synthesis intermediates. 
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A .qvntH peis of pi-ohft arravs 

Arrays of probes can be synthesized in a step-by- 
step manner on a support or can be attached in presynthesized 
form A preferred .method of synthesis is VLSIPS™ (see Fodor 
et al 1991. Fodor et al., 1993, Nature 364. 555-556; McGall 
et al USSN 08/445,332; US 5,143,854; EP 476,014), which 
entails the use of light to direct the synthesis of 
oligonucleotide probes in high-density, miniaturized arrays. 
Algorithms for design of masks to reduce the number of 
synthesis cycles are described by Hubbel et al.. US 5,571,639 
and US 5.593.839. Arrays can also be synthesized in a 
combinatorial fashion by delivering monomers to cells of a ^ 
support by mechanically constrained flowpaths. See Wxnkler et 
al EP 624,059. Arrays can also be synthesized by spotting 
.onimers reagents on to a support using an ink jet printer. 
see id.; Pease et al., EP 728,520. 

The basic VLSIPS™ approach can readily be adapted 
.o synthesize pooled mixtures of probes. The component probes 
of a^ool are synthesized in series. Synthesis of a pooled 
:Lbes starts with a substrate covered with a P^-----J 
protective group. The group is partially removed by limited 
Exposure of substrate to light. The deprotected sites are 
c:^ped with a protective group that is nonphotosentive but can 
be removed by other means, such as a chemical solvent. The 
remaining sites are then exposed to more ^^^^^^J^^ 

remaining photosensitive -^^X^; ^TZ-s^^^^^ -il 
proceeds on the exposed sites in a step t>y sz v 
first members of pooled probes are synthesized. 
nonphotosensitive capping groups are then removed. Synthesis 
proceeds anchored from these sites in a step-by-step fashion 
until second members of pooled probes are formed. 

is determined for each probe in the array. For ^ 
labels, hybridization intensity can be determined by, for 
exampl;. I scanning confocal microscope in photon counting 



PCT/US99/19069 

WO 00/11223 21 
mode Appropriate scanning devices are described by e.g., 
Trulson et al.. US 5,578.832; Stern et al., US 5,631,734 and 
are available from Affymetrix, Inc. under the GeneChip 
label. Some types of label provide a signal that can he. 
amplified by enzymatic methods (see By making use of 
enzymatic methods of mismatch discrimination^^ . 

c. ^ Meth ^*^^ of Use 

T.'ir.Vaqp. Analvsis 
Grouping and ordering genomic DNA markers is used to 
make physical and genetic maps, and to establish the 
relationship between genotype and phenotype. Markers in close 
proximity on the same DNA molecule tend to be inherited 
together, because the likelihood of a recombination event 
between two markers decreases as their separation decreases. 
The quantitative measurement of this phenomenon and its 
application to genetic analysis, was pioneered by Sturtevant 
in 1913 ^ Since then, a wide variety of genetic and 
molecular mapping techniques have been developed and used to 
build genomic maps based on both genetic and physical DNA 
markers. Recombination-based mapping has been supplemented by 
molecular techniques, such as radiation hybrid mapping , and 
methods that are based on single chromosome copy analysis 
4 5 or allele-specific detection, ^^ Other methods for 
establishing linkage include direct visualization of markers, 
using, for example, electron microscopy and fluorescence in 

8 9 

situ hybridization (FISH) ' . , 

in the present methods, linkage is determined using 
arrays containing cells occupied by peeled probes, targets 
Jor Unkage analysis are typically known se:^.nces containing 
at least two polymorphic sites. The target sequences may 
may not have previously been characterized for » 
The polymorphic sites are typically from 10-100,000, SO-10,00, 

100-5000 bases apart. As indicated above, an array for 
analysing such a target sequence contains mixtures of pooled 
;::brs corresponding to different combinations of pol^orphic 
forms at the target. If a target contains two polymorphic 
sites, one probe in a pool is typically an allele specific 
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probe overlapping one of the sites and complementary to one of 
the polymorphic forms at this site. The other probe in the 
same pool is typically an allele specific probe that overlaps 
the other polymorphic site in the target sequence and is 
complementary to a polymorphic form at that allele. If a 
target sequence contains two polymorphic sites and two 
polymorphic forms are present at each site, there are four 
possible combinations of polymorphic forms, and four pools or 

probes are required. 

If a target sequence contains two polymorphic sites 
and there are three polymorphic forms at each site, then there 
are nine combinations of polymorphic forms and nine pools of 
probes are required. If a target sequence contains three or 
more polymorphic sites, linkage between all three sites can be 
simultaneously determined using pooled mixtures of probes, 
each pool containing three probes. If each of three 
polymorphic sites has two polymorphic forms, then there are 2 
X 2 X 2 combinations of polymorphic forms, and eight pools of 
three probes each are required. The hybridization patterns of 
these more complex arrays are interpreted in similar manner to 
that discussed above for single polymorphic sites with two 
forms. In the general case, when such an array is hybridized 
to a homozygous target molecule, a single pooled probe mixture 
has a combination of probes matched with the target molecule 
and shows the highest binding to the target . When the array 
is hybridized with two heterozygous target molecules, two 
pooled probe mixtures having component probes matched with 
target segments in the same molecule show highest binding. 
Other pooled probe mixtures show varying degrees of lesser 
binding depending on how many component probes, if any. are 
matched with a segment of one of the target molecules. 

In addition to pools of probes, arrays for linkage 
analysis sometimes contain additional cells occupied with 
single species of probes. These probes are of the same 
composition as the probes contained in the pooled mixtures, 
and serve as controls to assess cooperative binding of targets 
to pooled mixtures. If all probes in a pool are complementary 
to segments on a single target molecule, then cells occupied 
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by individual probes from the pool should «ho« specific 
binding to the target, but the aggregate of specific binding 
(normalized if appropriate to acco-odate differences in probe 
deposition among other factors) should be less than the 
binding to the pooled probe position. By contrast if all 
probes in a pool are complementary to a segment of a target, 
but the different probe are complementary to segments in 
different target molecules, the binding of the target to a 
pool of matched probes is greater than the aggregate of 
Lnding of target to cells of individual probes (aft- 
appropriate normalization) . The relative binding of a target 
to pooled probes v,ith the aggregate of binding to the 
components probes of a pool therefore serves to provide 
confirmation of target assignments based =■> -"P""-; °' 
relative binding of target to different pools of probes. 

Mditional confirmation of target assignments can be 
Obtained by comparing binding of intact target molecules 
lontaining two or more segments matched with a pool of probes, 
"th control substrates containing only one "f^"; 
optionally, target and control substrates are ^J;"-;"""^ 
labelled to allow simultaneous application to an array. 

and controls are also optionally diluted to e^a i ed 
iab!l intensities before application to the array^ 
target molecule shows stronger binding to a pooled mixtures 
p Jes matched with segments of the target than does the 
Lture of control substrates .with 

effects due to different amounts of target, if needed) _ The 
intact target and the mixture of control substrates bind to 
Zri:Zl.y the same extent to arrays occupied by individual 

^ mWMire The pattern of relative 

ndrng':rt:r:era /co ris'to pooL of probes and to 

tfdu oircnent probes of pools, thus can P-ia. further 
confirmation that Un-cage has been correctly assigned in a 

The principles that are used to analyze linKage in a 
single target se^ence can be expanded to design an array that 
can analyze linkage in any number of target sequences, 
i::iud n'g, if desired, all or substantially all se^ences in 
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the human genome. Such en erray contains a group °f 
cells- occupied hy pooled probes, optionally with add.t.onal 
cells occupied by single probes, £or each target sequence 
Each group of cells is then analyzed independently in the 
manner described above. 

^ General methods for using arrays o£ probes for 

monitoring expression of mRHA populations are described in 

: ; AosL and wo s.n^^^^. such -"--^ --y-r 

of probes complementary to target 

L Ib^ populations or an a^lif ication product thereof is 
^p^d to such an array, and targets of interest are 
!!ntified and optionally, quantified from the extent of 

eci c binding to complementary probes. Optionally, binding 
:r r t A,!bes .nown to be mismatched with the target can 
be used as a measure of background nonspecific b-^ing and 
subtracted from specific binding of target to complementary 

The present methods employ arrays having at least 

. cells occupied by pooled probe mixtures for expression 
some cells occupi y P 

" nlnt probes ar brth complementary to nonoverlapping 
component probes a ^^^^.^^ ^^^^ 

segments o the ^ „ ,,timize specific 

sequence of the segme ,£,ects and cooperative 

binding due to base f/f/ /.tected by 

w Air^r. effects- The presence of a target 

i c bi ng Of the target to pooled P-be mixtures 

rntaining component probes ^'^^--^"^ ::..Z::Zof 
ootionally such binding can be compared with binding 

x rd"--"^ - ""^^"^ rorrnt" 

p^cbes contains a mismatch with a target of interest^ 
Ending of mismat.ed P--- ^ ^ rrtched 
baclcground and can b« ,„ ,,„ei„g between 

■ ^""' cf'pfr ectly matcb a ,„bes and a pool of mismatched 

^ . s sCif ies that the mBNA to which the matched probes a. 
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matched probes is typically at least 1,2, 1.5, 2, 5 or 10 or 
20 times higher than binding to the mismatched probes. Use of 
pooled probes in expression monitoring methods can increase 
the ratio of signal to background and therefore result in 
greater sensitivity and/or greater accuracy in quantitative 
measurements of mRNA levels. Such methods are particularly 
valuable with complex mixtures of target sequence (e.g. total 
mRNA populations, total genomic DNA) 

g. Use of pooled probes in sequen ce analysis 

Target nucleic acids of unknown sequence can be 
sequenced by hybridization to an array containing all probes 
of a given length in a process sometimes known as sequencing 
by hybridization. The subset of specifically hybridizing 
probes in such an array is identified, and the sequence of the 
target is assembled from the sequences of these probes (see 
e.g., EP 562047). One potential limitation of such methods is 
that segments of target sequence commensurate with the length 
of probes typically used for hybridization assays recur with 
significant frequency in a target sequence. Such recurrence 
limits the length of target sequence that can be applied to an 
array, and/or complicates interpretation of the hybridization 

pattern of an array. 

The present methods provide a solution to this 
problem using pooled probe mixtures. In the present methods, 
an array of probes is constructed which comprises a number of 
subarrays of pooled probed mixtures. The pools in each 
subarray have one common probe, and one variable probe. 
Collectively, the variable probes in a subarray constitute all 
probes of a given length. The common probes vary between the 
different subarrays. The common probes are chosen to be 
complementary to known regions of target sequence flanking 
regions to be sequenced. For example, if one is sequencing a 
chromosome, the common probes can be designed to be 
complementary to known markers distributed at fairly regular 
intervals throughout the chromosome. 

Optionally, the target sequence can be fragmented 
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before application to an array, although any fragmentation 
should leave intact linkage of marker regions to flanking DNA 
to be sequenced. After hybridization of target sequence to 
the array, binding to the probes in the different subarrays is 
determined. A region of target sequence is then compiled from 
each subarray from the oligonucleotides showing specific 
binding in that subarray. Each region of target sequence read 
from a subarray is mapped as being in proximity to the marker 
complementary to the common probe included in pooled mixtures 
in that subarray. Accordingly, the present methods allow 
simultaneous sequencing and mapping of numerous segments of a 
target sequence. 

use Pooled Probes in Rsseoupncinq Array 

As noted in the Background Section, WO 95/11995 
describes methods of resequencing involving comparison of 
binding of four probe sets to a target. One probe set 
contains overlapping probes that span a reference sequence and 
are complementary to the reference sequence. The other probe 
sets contain corresponding probes for each probe in the first 
set, except at an interrogation position, at which 
corresponding probes from the four probe sets differ, 
corresponding probes are often arranged physically or 
conceptually as a column on a support. Binding of target to a 
column of four corresponding probes is compared, and a base m 
the target sequence is assigned as the complement of the base 
occupying the interrogation position of the probe showing the 
highest specific binding to the target sequence. 

Such methods can also be performed using pooled 
probe mixtures rather than single probes . The components of 
such probe mixtures can be viewed as first and second groups 
of probes, each mixture having one probe from the first group 
and one probe from the second group. The first group of 
probes contains the same four probe sets as described in WO 
95/11995. The second group of probes contains probes that are 
complementary to the reference sequence and allow cooperative 
binding with a partner probe from the first group. 
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Each of four corresponding probes in the first group 
is usually matched with the same partner probe from the second 
group. The partner probe is designed to be complementary to a 
segment of target DNA that does not overlap the segment to 
which the four corresponding probes are complementary but is 
sufficiently proximate to allow cooperative binding between 
two probes in probe mixtures. Optionally, the segments of DNA 
bound by the partner probe and the four corresponding probes 
can be immediately adjacent (i.e.. with no intervening bases) . 

Different columns of four corresponding probes from 
the first group can be matched with the same or different 
partner probes. For example, in some arrays, each of the 
columns of four corresponding probes is matched with the same 
partner probe. In other methods, columns of corresponding 
probes are matched with different partner probes such that the 
separation of target segments bound by partner probes and 
corresponding probes remains constant. Alternatively, 
different columns of four corresponding probes from the first 
aroup can be matched with different partner probes without a 
constant separation between segments bound by the probes m a 

pooled mixture . , . r «i o 

in all of the above arrangements, binding of pools 

is compared for four pools containing four corresponding _ 
probes from the first group. A base in the target sequence is 
identified as the complement of the base occupying the 
interrogation position of the pool showing highest binding 
comparison of pooled probes in this manner can offer a higher 
ratio of binding of matched to mismatched pools, thereby 
increasing the accuracy of assignment of bases in the target 
sequence . 

/ f) Moag^i-rina gA.rpiftnce Length 

The incremental normalized binding of a target to 
pooled p;robes matched to the target relative to the aggregate 
Mnding^o components of the pool is related to the separation 
of target segments bound by the probes. As the spacing 
increases, co-operative binding decrease, as does the 
incremental binding of target to a pooled probe mixture. 
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These binding characteristics can be used to assess the length 
of a segment of unknown length flanked by two know markers. 
For example,, the segment of unknown length can be a 
restriction fragment length polymorphism including di, tri and 
tetranucleotide repeats. 

An array is designed containing a pooled probe with 
two component probes complementary to markers flanking a 
sequence whose length is going to be analyzed. Also present 
in the array are control cells containing each of the pooled 
probes without the other. The array is then calibrated for a 
series of control substrates in which the known markers are 
separated by different known lengths of intervening sequence. 
For each control substrate, a ratio is determined between 
binding to the pooled probe mixture and the aggregate of 
binding to individual probes. This ratio can then be plotted 
as a function of intervening sequence length. As the 
intervening sequence length increases, the. ratio of binding 

decreases toward unity. 

The array is then hybridized with a target to be 
analyzed containing the two known markers separated by an 
intervening segment of unknown length. The ratio of binding 
of target to pooled and control probes is determined as 
before The length of the intervening segment can then be 
read from the plot or by computerized interpolation. This 
type of analysis is particularly useful in assessing the 
length of trinucleotide repeats which are associated with 
several genetic diseases, such as Huntington's disease. 

( q) Or ^^Af of Fra«=rmentS 

in genomic sequencing or restriction mapping, one 
sometimes knows that a target sequences is comprised of 
several specific component segments, but does not know the 
order of the segments. For example, one might know that a 
target sequence has component sequences a, b and c, but not 
know whether the order of the sequences is abc, acb, or bca. 
The order of the component sequences can be determined by 
hybridization to an array containing pools of probes, each 
pool having two component probes hybridizing to different 
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segments, the '-different pools differing in the combination of 
segments to which their component probes hybridize. 

In general, segments of a target sequence that are 
closer together show greater cooperativity in binding to 
pooled probe mixtures that segments further apart. Thus, for 
a target sequence abc. a pooled mixture of ab or be shows 
stronger binding relative to the aggregate of a + b or b + c 
than does a pooled mixture of ac relative to a + c. Thus, the 
relative binding of target to the three pooled probe mixtures 
relative to the aggregate of binding to appropriate component 
probes indicates, which target segments are in closet 
proximity to each other, and thus, the order of segments in 

the target sequence. 

A potentially interesting application is sequence 
reconstruction (contig assembly) in sequencing by 
hybridization (SBH) . One of the major obstacles to using 
short DNA probes for SBH is the presence of repeated 

sequences. De novo SBH utilizes an array of the set of all 
probes of length n (usually 6-10) to determine the presence of 
length n subsequences in the target of interest. These 
subsequences are compared to each other and aligned to 
reconstitute the contiguous target sequence. Repeated regions 
of length n or longer complicate the sequence reassembly, 
since it is no longer possible to unambiguously determine the 
linkage of sequences on either side of the repeat . It has 
been proposed that measuring the distance between reference 
points in a se^ence could increase the P°-f ^/ f ^^^^^ 
length an array can resequence by more than 4-fold . It is 
now possible to assess experimentally the distance between 
reference points by testing con^inatorially a large number of 
alternative probe pairings. For example, if there are three 
contigs a. b. and c, of possible order ^.^T^'^'^f ^ 

order can in principle be established by hybridizing the 
target to the pairwise probe combinations a-b and a-c . In 
addition, it may be necessary to compare hybridizations of the 
target fragmented to various extents, e.g. with restriction 
enz^es. ^ing the principle of Sturtevant ^ sequences close 
together will be more likely to remain linked. 
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EXAMPLES 

MATERIALS AND METHODS synthesized 

Oligonucleotide Arrays. DNA arrays were sy 

ina 5 • -MeNPOC protected phosphcramidites • A MeNPOC 
::::!cted txaetllene .lycol p.osp.ora.idite is coup.ed to a 

glass substrate which has been -^-^^^^ ^^,^,^,,3 ,3 

bis(hydroxyethyl)aminopropyltriethoxysxlane. The substr 

then expo^d to 365nn, light through a lithographxc mask The 
then exposes , , n j ^u*- that only the 

«e«POC protecting group is re^ve. by Ught, so that ^ 
„„=.d sites become available for coupling. Repeatea y 
oriight-dir cted aeprotection and nucleoside base coupling 
^ rhe efficient parallel synthesis of any desired 
allow the efficient P different oligonucleotides. 

v,i„»Mon of large nunibers of different oiia 
combination oi la 3 .v..<. paired arrays are 

paired probe array synthesis. Pairec a 
v^thesized in a similar fashion (Fig. 1) • The primary 
aynthesized in coupling the first nucleoside the 

alfference IS that^ior ^^^^^ 

a^ray !^,„uing in an approximate X:l mixture of 

protecting group, resulting ^ tected sites are 

-"rr' ::\To:^^rr:« -creLtd!. ^he ^ i= 

th "g LrphLodeprotection and synthesis cycles, 
r rt^lnm: «e»OC ^'-^'^ZJ^J^^-X n> . upon 

-t:^^:::;::::^ ^^^^^^^ - r::: 

of N, M Diethyl synthesized 
further elongation. After the iir v .„^„.,-t.d 
J OUT protected sites are deprotectea 

and capped, '■^^ "^J^l protected linker is coupled 

,3»TCA/DCM, 5 s . is then patterned 

T Ti e 'using mIkpOC photochemistry, the result is an 

Tay tth aTxture of two different ^ probes at every 

^.led Target.. DSA oligonucleotides bearing a 

»^T, label were synthesized on solid 
terminal fluorescein label were s,y 

' ur^r^s using ^^^^rz^^^:^^^-^- 

:tr::Thr::Ure";-.c=\ct cac g»g age tct oca t. att 
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Ngg tat ttt cgt ctg gga ggt atg cac gcg ata gca. where F 
denotes fluorescein. The letter N indicates positions 10 and 
27. The base at these positions is indicated in the name of 
each oligonucleotide. Likewise, oligonucleotides 10c and lOg 
are based pn the sequence 5'Fct cac gNg age tct c, and 27c and 
27t are based on 5'F tgc att Ngg tat ttt. The lOc, lOg, 27c. 
and 27t sequences were derived from the position 10 and 27 
double variants listed above. In addition to the 
oligonucleotide targets, human mitochondrial DNAs of 160 bases 
and 2.5 kb were prepared using single -stranded asymmetric PCR. 
These DNAs were amplified from samples previously sequenced on 
an ABI 373A DNA Sequencer. Labeling was by incorporation of 
biotin-16-dUTP during PCR. Two 2.5 kb amplicons were 
prepared, differing at three positions. Amplicon 1 had the 

sequence 93c-1438c-2131a. Amplicon 2 had the sequence 

93t-1438t-2131g. 

RESULTS 

Cooperative hybridization and linkage detection. To determine 
whether cooperative hybridization could be used to distinguish 
between linked and unlinked pairs of otherwise identical 
sequences, we designed a simple experiment to assay for 
linkage between a pair of 9-mer sequences separated by 8 
nucleotides. The principle is illustrated in Fig. 2. 

A 4 X 4 array was synthesized, in which each 400 /xm 
X 400 Mtn site contained a mixture of two different 9-mer probe 
sequences, Probe 1 (3 ' -gtgcN^ctcg-S ' ) and Probe 2 (3'- 
gtaaN2Ccat-5 ■ ) . To demonstrate that any cooperative effect 
was sequence-specific, we designed the array to include four 
variants of each of the probes, in which the central base of 
Probe 1 and Probe 2 was substituted with A, C, G, or T. The 
resulting array contained 16 sites. Each site contains a 
different combination of and for the two probes. In 
this way all sixteen mixtures of Probe l-N^ and Probe 2-N2 
were synthesized (Fig. 3) . 

Two sets of hybridizations were performed. First, 
physically linked targets complementary to Probe 1 and Probe : 
were hybridized to the array. If pooled probes could 
hybridize in a cooperative manner to two pooled probes, the 
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•„nal from an array containing two pooled probes would be 
"' .er than the sum of the non-cooperative hybridization 
' in the regions where only one probe perfectly matched 

sxgnals .n th g ^^^^^^^ ^.dependent targets 

the target. Secona. ^ „wture were hybridized to 

matching each of the probes in a mixture were y 

'.;r.:aizacicn signals in the regions only one or 

.„ereL -""^ r—^uirr. 

and probe 2 were used to ^°«="9' nuoresoence intensity 

,he paired hybridization (F.g. ■ J^J' „^ 
of the linked targets was always greater than 

-fs rn^rUrsTer ther:atched both 

"■^ 'frr berw -3x1he L oj the intensities where they 
paired probes were unlinked targets, 

matched lust probe 1 or Probe 2^ ^^^^^^^ ^^^^ 

„ fVi*» reaions where i-ne uexj.^^ 
the intensities - ^^^^ of the regions 

paired probes was 15-35% less tha ^^^^ 
Lre they matched Probe 1 or ^-^^ ' J^^^^ ,,„ce 
Signal may be due to ....... .here 

almost twice as much target P .^^^^^^ ratio 

"Th: rir: irsTd-sirgle^^^^^ changes was also 
between the correct ca ^.^„ets These results 

markedly better ^ linked target 

demonstrate ^-^^^^^^l" \ the linkage or 

ee^enoes to ^""^^f ^"„,,\,„nT dlstin^ish.d, and the 
ratrrblle: a^^"/.. w^e correctly determined in the 
physically linked targets. 

. of linhage in a heterozygous mixture. To determine 
Assignment of linkage 1 ^^^^ 

« ^V-"^-^^»."^^^"tn Cle. h re.„.ygotes, two further 
assign linkage directly in ^guimolar mixtures 

:7:rirnLrt:rre::r^^^ - 
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array. In the first experiment, the mixture consisted of 
10c-27t and 10g-27c. In the second experiment, the mixture 
was of 10g-27t and 10c-27c. Although the two experiments have 
targets that are identical in sequence composition, the 
pairing is different. The results are shown in the left-hand 
bottom two panels of Fig. 4. In each mixture, it was 
straightforward to assign linkage. In each case, the linked 
sequences are clearly discriminated from the other possible 
arrangements (e.g. c-c, g-t vs. g-C c-t) . Even though the 
probes in the four array positions c-c, c-t, g-c, g-t are 
complementary to equimolar amounts of target in the 
hybridization mixture, there is significantly more signal 
where the two probes are perfectly complementary to the same 
target molecule (1.4-7 x intensity). Furthermore, the control 
hybridization, in which unlinked targets have the same ^ 
sequence composition as the linked targets, shows lower signal 
and no evidence of cooperativity . These results show that 
paired probe arrays can be used to assign linkage in mixtures 
containing two different multiply polymorphic alleles. 

cooperative hybridization over distances of more than 2 
kilobases, and construction of SHP haplotypes. To examine the 
strength and specificity of linkage over a greater distance, 
we synthesized paired probe arrays with the probe ^equences 
chosen from different regions of a 2.5 kb mitochondrial DNA 
amplicon. The length of the probes was increased to 30-mers 
to allow hybridizations to be performed under more stringent 
conditions. Higher stringency was used to reduce the 
secondary structure in the targets and to favor cooperative 
hybridization by destabilizing individual hybridizations. 

Arrays of three different designs were synthesized, 
in each design a different pair of single nucleotide 
polymorphisms (SNPs) was interrogated. In each of three 
experiments shown, a 50:50 mixture of two 2.5 kb target 
a!^licons was analyzed on a paired array (Fig. 5 . The two 
2 5 kb amplicons are from the identical region of human 
„.itochcndrial DNA, but are polymorphic and differ from each 
other at the specific sites analyzed by the arrays. In the 
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tirst e:cperime«, S«PB at positions »38 and 2131, separated 
^rr«^=a-tid.s, were analysed. The second --P'"-« 

Tied SNPS 13« nucleotides apart, at positions S3 and »38. 
r hid e^eri^ent .^eried S»Ps 20,S nucleotides apart, at 
The third exp experiment, the correct probe 

positions 93 and 2131. in ea ^ results show 

,no have the highest intensity (Fig. 5). The result 
/la blse^isJtch discrimination and Uncage detection 
,,„gle "I'^r" ^i„„„3 up to 2.1 icb, although the 

:r<erenrs of" bases out of SO, as opposed to 2 bases out o. 

" " r Lrre'Thtn analy«d by comparing each probe 

• „ its 6 single base alterations (the 3 single base 
pair to Its 6 si g constant, and the 3 single 

changes in Probe 2. "^^P'^S constantl (Fig. 5, 

base changes in Probe 1, keeping Probe 2 ^^^^^^^ 

,l,ht hand colum... " this ,„es is even 

Unkaae assignments and the incoxicu 
,he ^ , rte discrimination chart means 

more apparent. A SCO ^^^^^ ^^^^ 

fhl ::::: InrenSty an its one base alterations. I.e., 
discrimination ^e ^ o-he ^robe ^a..g a-a -^^ 

1:;::::::^:.:;! :s::gnments are close to .ss th^^ 1,^ 

, .V ^r^•r^ect linkage assignments produce vaxue 
while the correct iini^ctu = correct 

AS with the 9-mer paired probe array, tne 
greater. As witn 93t-1438t, 1438c-2131a, 

linkage assignments (93c-1438c. 93t 

tn.na 93c-2l31a, and 93t-2131g) were easily 

unambiguously reconstructed. 

3...nc. .dependence /^tslnrltllltrrtr" 
' '-"T^X::r:.:Z::. t rsit>ed two more arrays to 
''" ticate cooperative hybridization with a much larger 
:re ^variLy of di«erent se^ences. The aim was to 
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determine to what extent the cooperative effect we observed 
was sequence -dependent to assess if it could be extrapolated 

to a larger set of probes. 

In the first experimental design, six different 12- 
mer sequences were each paired with a resequencing array 
containing eighty-six overlapping 15-mer probes. Therefore, 
each of these resequencing arrays contained a constant 12-mer 
sequence to act as an "anchor" for the target, which was 
queried by the variable 15-mer probes in the array. Each 15- 
mer was represented by 4 single-base substitutions (A. C, G 
and T substitutions at the central position) to give a total 
of 2064 combinations (6 anchors x 86 probes x 4 
substitutions) . Of these, 516 were perfect matches to both 
probes in the pair, and the remainder contained a single base 
mismatch at the central position in the 15-mer P^^^ 

A 160 nt fluorescein- labeled amplicon was hybridized 
to the array, and an image of the hybridization pattern was 
obtained (Fig. 6) . The array included an unpaired tiling as a 
control. Every paired tiling gave greater signal than the 
nontrol tiling. The extent of this improvement, m overall 
i^enlitrand discrimination, is related to the hybridization 
strength of the region represented by the constant I2mer 
anchor sequence. For example, positions 1-12 have very low 
intensity^n the control tiling, and anchor 1-12 has a small 
Effect Jn the intensity and specificity of the hybridization 
in its paired tiling. Positions 15-26 perform well in the 
control, and have a dramatic impact as an anchor sequence. No 
signal increase was expected in the regions where 
interrogation probe sequences overlap with the anchor sequence 
(outlined in Fig. 6). because each target molecule cannot 
hvbridize to more than one probe in these cells. 

The results showed that a variety of different 
anchors can be paired with the same resequencing tiling with 
■ -i^T effect- that a given anchor sequence can pair 

« ; ^ -'^^V ai«e„nt ee<^ences at various 

a Xnces iL t.e ancncr,. and that si^al enhancement W the 
anchor probe Is related to the hybridization strength of the 
anchor sequence. 
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To extend these observations and to examine probe- 
anchor interactions over greater distances, we next paired a 
single 20-iner anchor probe with a 20-Tner tiling array 
interrogating 2544 nucleotides of the human mitochondrial 
genome. Two separate arrays were synthesized. The first 
contained only the 10,176 20-mers (2544 x 4 substitutions per 
position) as a control. In the second array each cell was a 
mixture of one of the 10,176 probes and the 20mer 
complementary to positions 1427-1446 on the amplicon. 

An analysis of signal intensities versus position in 
the 2544 base sequence reveals the characteristic dip in the 
region where the anchor overlaps with the variable probe, and 
increased signal intensity and discrimination elsewhere on the 
array (average of I5x fluorescent intensity over the unpaired 
control) (Fig. 7). At separations greater than 1000 bases 
between the anchor and probe sites on the target, , a strong 
cooperative binding effect can still be seen. Under the 
conditions used, 97% of the sequence (2459 out of 2544 bases) 
could be determined by simply identifying the probe with the 
greatest intensity in each set of A, C, G, and T substitution 
probes, in contrast, under these stringent conditions, only 
84% of the sequence (212B bases) could be determined on the 
unpaired control array. 
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TABLE AND FIGURE LEGENDS 



Table 1 . Hybridization Experiments 
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Although the foregoing invention has been described in 
detail for purposes of clarity of understanding, it will be 
obvious that certain modifications may be practiced within the 
scope of the appended claims. All publications and patent 
documents cited above are hereby incorporated by reference xn 
their entirety for all purposes to the same extent as xf each 
were so individually denoted. 
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1 1. An array comprising a support having at least 

2 three discrete regions, a first region bearing a pool of 

3 polynucleotide probes comprising first and second probes, a 

4 second region bearing the first probe without the second probe 

5 and a third region bearing the second probe without the first 

6 probe . 

1 2. The array of claim 1, wherein the first and 

2 second probes are respectively complementary to first and 

3 second nonover lapping segments of a target sequence . 

1 3. The array of claim 2, wherein 

2 the first and second nonoverlapping segments of the 

3 target nucleic acid respectively contain first and second 

4 polymorphic sites, and 

5 the first probe is complementary to a polymorphic 

6 form of the first site and the second probe is complementary 

7 to a polymorphic form of the second site. 

1 4. The array of claim 3, wherein the second region 

2 bears the first probe with a third probe as a second pool of 

3 polynucleotide probes, and the third region bears the second 

4 probe with a fourth . probe , as a third pool of polynucleotide 

5 probes . 

1 5, The array of claim 4, wherein the third probe i 

2 complementary to a second polymorphic form of the second 

3 polymorphic site, and the fourth probe is complementary to a 

4 second polymorphic form of the first polymorphic site. 

1 6. An array comprising a substrate having a 

2 plurality of discrete regions, different regions bearing 

3 different pools of probes, a pool of probes comprising first 

4 and second probes complementary to nonoverlapping segments oi 

5 a target sequence. 
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3^ 7, The array of claim 6, wherein the nonover lapping 

2 segments of the target sequence include first and second 

3 polymorphic sites and the first and second probes are 
respectively complementary to polymorphic forms of the first 

5 and second polymorphic sites, the different pools comprising 

6 probes complementary to different combinations of polymorphic 

7 forms, the different pools differing in the combination of 

8 polymorphic forms. 
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8. The array of claim 6, wherein the first probe is 
the same in at least a subset of the plurality of pools and 
the second probe varies in different pools in the subset. 



9. The array of claim 6, wherein the pool of probes 
comprises first and second subsets of probes, 

each pool in the first subset of pools having a 
7 common first probe and a different second probe, 

each pool in the second subset of pools having a 
common first probe and a different second probe. 

the common first probe differing between the first 
subset of pools and the second subset of pools. 



10. An array comprising a support having at least 
three discrete regions, a first region bearing a pool of 
polynucleotide probes comprising first and second probes at a 
first molar ratio of first to second probes, a second region 
bearing the first probe without the second probe or with the 
second probe present at a second molar ratio of first probe to 
second probe greater than first molar ratio, and a third 
region bearing the second probe without the first probe or 
with the first probe present at a third molar ratio of first 
probe to second probe less than the first molar ratio. 

1 11. A method of determining linkage of polymorphic 

2 forms in a target nucleic acid, comprising: 

3 hybridizing a diploid target nucleic acid having 

4 first and second polymorphic sites to an array comprising a 

5 support having at least three discrete regions, a first region 
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bearing a pool- of polynucleotide probes comprising a first 
probe complementary to a polymorphic form of the first 
polymorphic site and a second probe complementary to a 
polymorphic form of the second polymorphic site a second 
region bearing the first probe without the second probe and a 
third region bearing the second probe without the first probe; 

determining a ratio of binding of the target nucleic 
acid to the first region and to the second and third regions 
confined to indicate whether the polymorphic form of the first 
polymorphic site and the polymorphic form of the second 
polymorphic site are present in the same molecule of the 
17 diploid target nucleic acid. 

12. A method of determining linkage of polymorphic 
forms in a target nucleic acid comprising: 

hybridizing a diploid target nucleic acid having 
first and second polymorphic sites to an array comprising a 

"pjort having a plurality of discrete regions, the different 
regions bearing different pools of probes, a pool of P-be 
Tom^rlsing first and second probes respectively complementary 
o Tol^lo'rphic forms of the first and second PO«; 
sit!s ^h7different pools comprising probes complementary to 
different combinations of polymorphic forms; 

determining binding of the target nucleic acid to 
the discrete regions to identify at least one discrete region 

nucleic acid bound by the discrete regions, the at least 
discrete region bearing a pool of probes respectively 
c : lelentary to a co^ination of polymorphic forms present in 
a single molecule of the diploid target nucleic acid. 

13 The method of claim 12. further comprising 
hybridizing a control mixture of a first nucleic acid having a 
pCor^hi! form at the first polymorphic site and a second 

, -/..oid having a polymorphic form at the second 
""^^LLhlc s^te and dete^ining hybridisation of th. mixture 
foretrcrete regions; determining binding - -J ^ 
region to the discrete regions; and comparing b.nd.ng of 
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8 target nucleic acid and control to the discrete regions to . 

9 identify a discrete region binding more strongly to the target 
nucleic acid than the control, this discrete region bearing a 
pool of probes respectively complementary to a combination of 
polymorphic forms present in a single molecule of the diploid 

13 target nucleic acid. 

^ 14. A method of sequencing a target nucleic acid. 

2 comprising: 

hybridizing the target nucleic acid to an array 
comprising a substrate having a plurality of discrete regions 
bearing different pools of probes, each pool having a common 
first probe and a different second probe, the common first 
probe complementary to a knovm marker in the target. 

determining a sequence of a segment of the target 
nucleic acid from the relative binding of the target nucleic 
acid to the pools of probes; and 

mapping the position of the segment in the target 
12 sequence relative to the knovm marker. 

1 15. A method of sequencing a target nucleic acid. 

2 comprising 

hybridizing the target nucleic acid to an array 
comprising a substrate having a plurality of discrete regions, 
different regions bearing different pools of probes, wherein 
the pools are subdivided into first and second subarray of 
pools, each pool in the first subarray of pools havxng a 
common first probe and a different second probe, each pool in 
the second subarray of pools having a common first probe 
complementary to a known marker in the target, and a different 
second probe, the common first probe in the first subarray of 
pools being complementary to a different known marker than in 
13 the second subarray of pools; 

, . determining a sequence of first and second segment 

15 of target nucleic acid from the binding of the target nucleic 

16 acid to the pools in the first and second subarrays; and 
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3^7 mapping the position of first and second segments in 

18 the target nucleic acid relative to the positions of the known 

19 markers. 

1 16 . A method of monitoring expression of an mRNA 

2 population/ comprising: 

3 providing a sample comprising a population of mRNA 

4 molecules; 

5 hybridizing the population of mRNA or nucleic acids 

6 copied therefrom to an array comprising a support having a 

7 plurality of 'discrete regions, the different regions bearing 

8 different pools of probes, a pool of probe comprising first 

9 and second probes respectively complementary to nonover lapping 

10 segments of a known mRNA molecule, the different pools 

11 comprising first and second probes complementary to 

12 nonoverlapping segments from different known mRNA molecules; 
3^3 determining which discrete regions show specific 

14 binding to the population thereby indicating which of the 

15 known mRNA molecules are present in the sample. 

1 17. The method of claim 16, wherein the support 

2 further comprises a second plurality of discrete regions, the 

3 different regions bearing different pools of probes, each pool 

4 having the same first and second probes except for a single 

5 base mismatch in the first or second probe or both as a 

6 corresponding pool from the plurality of discrete regions, and 

7 the method further comprises comparing binding of 

8 corresponding pools of probes from the plurality and second 

9 plurality of discrete regions, a difference in binding 

10 indicating that the known mRNA to which probes in the pool 

11 from the plurality of discrete regions are complementary is 

12 present in the sample. 

1 18. A method of analyzing a target nucleic acid, 

2 comprising: 

3 hybridizing a target nucleic acid to an array 

4 comprising a support having at least three discrete regions, a 

5 first region bearing a pool of polynucleotide probes 
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6 comprising first and second probes, a second region bearing 

7 the first probe without the second probe and a third region 

8 bearing the second probe without the first probe; 

9 comparing binding of the target nucleic acid to the 

10 first discrete region with the aggregate of the target nucleic 

11 acid binding to the second and third regions to determine 

12 whether the target nucleic acid includes segments 

13 complementary to the first and second probes. 
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